BIOTECHNOLOGY gfefl gig ' 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: . Q/f£ 

Date Processed by STIC: ///Z/Qo&z* 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTED 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www>uspto,gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility thafcthe disk/CD-Rom may have been affected by treatment given to all incoming mail 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom, including: 

1. EFS-Bio (<http:/Avww.uspto,gov/ebc/efs/downIoads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, r other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



\ 



Raw Sequence Listing Error Summary 



ERROR DETECTED 
ATTN:/*EW RULES CASES: 



1 v Wrapped Nucleics 
Wrapped Aminos 



2 Invalid line Length 

3 ^ Misaligned Amino 

Numbering 

4 Non-ASCII 



Variable Length 



Patentln 2.0 
"bug" 



(NEW RULES) 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 

The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences Sequences) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequences) 



<210> sequence id number 
<400> sequence id number 
000 



missing. If intentional, please insert the following lines for each skipped sequence. 



Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence ftuleS, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 

Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
- • rr- "Unknown." Please explain source of genetic material in <220> to <223> section 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

12 Patentln 2.0 Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



9 Use of n's or Xaa's 

(NEW RULES) 



10 Invalid <213> 
Response 



11 Useof<220> 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 



OIPE 



RAW SEQUENCE LISTING DATE: 01/15/2002 

PATENT APPLICATION: US/10/023 , 066 TIME: 19:01:05 



Input Set : A:\BB1037 Sequence Listing.txt 
Output Set: N:\CRF3\01152002\J023066.raw 



SEQUENCE LISTING -yA I ) 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: E. I. DU PONT DE NEMOURS AND 

6 COMPANY 



4+ 



JjJUX,^ INCREASING THE LYSINE / ^ jU^M^ 

Axj^Jb^J&s AND THREONINE CONTENT \ If} > 

OLu^f^^^ QF THE SEEDS 0F plants .JL**^ ju-+^ "yt&u^ yUu/^J 

OF SEOtJENCES : 1 07 V fc , 



8 (ii) TITLE OF INVENTION: CHIMERIC GENES AND _ DOGS Npt Comply 

9 methods for increasing Corrected Diskette Needed 

10 

11 

W--> 12 

14 (iii) NUMBER OF SEQUENCES: 107 ^ w s / 

16 (iv) CORRESPONDENCE ADDRESS: JuUt^* MsOU 

17 (A) ADDRESSEE: E. I. DU PONT DE NEMOURS i f \ 

18 AND COMPANY aJL0U#\ J 

19 (B) STREET: 1007 MARKET STREET 

20 (C) CITY: WILMINGTON 

21 (D) STATE: DELAWARE 

22 (E) COUNTRY: U.S.A. 

23 (F) ZIP: 19898 

25 (V) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: FLOPPY DISK 

27 (B) COMPUTER: IBM PC COMPATIBLE 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS . 

29 (D) SOFTWARE: MICROSOFT WORD VERSION 2 . 0C 

31 (vi) CURRENT APPLICATION DATA: 

32 (A) APPLICATION NUMBER: US/10/023 , 066 
--> 33 (B) FILING DATE: 17-Dec-2001 

34 (C) CLASSIFICATION: 

36 (viii) ATTORNEY/AGENT INFORMATION: 

3 7 (A) NAME: BARBARA C. SIEGELL 

38 (B) REGISTRATION NUMBER: 30,684 

39 (C) REFERENCE/DOCKET NUMBER: BB-1037-C 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 302-992-4931 

43 (B) TELEFAX: 302-773-0164 

44 (C) TELEX: 835420 



ERRORS D SEQUENCES 

234 (2) INFORMATION FOR SEQ ID NO: 6: 

236 (i) SEQUENCE CHARACTERISTICS: 

237 (A) LENGTH: 917 base pairs 

238 (B) TYPE: nucleic acid 

239 (C) STRANDEDNESS: single 

240 (D) TOPOLOGY: linear 

242 (ii) MOLECULE TYPE: DNA (genomic) 

244 (ix) FEATURE: 




file://C:\CRF3\Outhold\VsrJ023066.htm 



1/15/02 



RAW SEQUENCE LISTING DATE : 01/15/2002 

PATENT APPLICATION: US/10/023 , 066 TIME: 19:01:05 



Input Set : A:\BB1037 Sequence Listing.txt 
Output Set: N:\CRF3\01152002\J023066.raw 



94 S 






(A) NAME/KEY: 


CDS 






















246 






(B) LOCATION: 


3. .! 


911 




















94ft 




(Xi) SEQUENCE DESCRIPTION: \ 


SEQ : 


ID NO: 6 














9sn 

Z -J VJ 


CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC ' 


TTC GGC 


47 


Z -J _L 


Met Ala ' 


Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His ] 


Phe Gly 




9S9 

Z J Z 




1 








5 










10 










15 




254 


ACC 


GTT 


GGA 


GTA 


GCA 


ATG 


GTT 


ACT 


CCA 


TTC 


ACG 


GAA 


TCC 


GGA 


GAC 


ATC 


95 


955 

Z -J J 


Thr 


Val 


Glv 


Val 


Ala 


Met 


Val 


Thr 


Pro 


Phe 


Thr 


Glu 


Ser 


Glv 

JL 


Asp 


He 




256 










20 










25 










30 






258 


GAT 


ATC 


GCT 


GCT 


GGC 


CGC 


GAA 


GTC 


GCG 


GCT 


TAT 


TTG 


GTT 


GAT 


AAG 


GGC 




259 


Asp 


lie 


Ala 


Ala 


Gly Arg 


Glu 


Val 


Ala 


Ala 


Tvr 

J. J X 


Leu 


Val 


Asp 


Lys 


Glv 




960 








35 










40 










45 








969 
^ u z 


TTG 


GAT 


TCT 


TTG 


GTT 


CTC 


GCG 


GGC 


ACC 


ACT 


GGT 


GAA 


TCC 


CCA 


ACG 


ACA 


191 


9fi 1 


Leu 


Asp 


Ser 


Leu 


Val 


Leu 


Ala 


Glv 


Thr 


Thr 


Glv 


Glu 


Ser 


Pro 


Thr 


Thr 




964 






50 










55 










60 










966 
z u u 


ACC 


GCC 


GCT 


GAA 


AAA 


CTA 


GAA 


CTG 


CTC 


AAG 


GCC 


GTT 


CGT 


GAG 


GAA 


GTT 


939 

Z ~J J 


967 


Thr 


Ala 


Ala 


Glu 


Lys 


Leu 


Glu 


Leu 


Leu 


Lys 


Ala 


Val 


Arg 


Glu 


Glu 


Val 




96 8 

Z U vj 




65 










70 










75 












97 D 
z / u 


GGG 


GAT 


CGG 


GCG 


AAG 


CTC 


ATC 


GCC 


GGT 


GTC 


GGA 


ACC 


AAC 


AAC 


ACG 


CGG 


9R7 
z o / 


971 
z / j. 


Glv 


Asp 


Arg 


Ala 


Lys 


Leu 


He 


Ala 


Glv 


Val 


Glv 


Thr 


Asn 


Asn 


Thr 


Arg 




979 

Z / Z 


80 










85 










90 










95 




974 


ACA 


TCT 


GTG 


GAA 


CTT 


GCG 


GAA 


GCT 


GCT 


GCT 


TCT 


GCT 


GGC 


GCA 


GAC 


GGC 


1 1 5 


275 


Thr 


Ser 


Val 


Glu 


Leu 


Ala 


Glu 


Ala 


Ala 


Ala 


Ser 


Ala 


Glv 


Ala 


Asp 


Glv 




97 fi 
z / u 










100 










105 










110 






97 ft 
z / o 


CTT 


TTA 


GTT 


GTA 


ACT 


CCT 


TAT 


TAC 


TCC 


AAG 


CCG 


AGC 


CAA 


GAG 


GGA 


TTG 


181 

J O J 


97 Q 

Z / _7 


Leu 


Leu 


Val 


Val 


Thr 


Pro 


Tvt 

JL 


Tvr 


Ser 


Lys 


Pro 


Ser 


Gin 


Glu 


Glv 


Leu 




9R0 

Z O VJ 








115 










120 










125 








9 ft 9 
z o z 


CTG 


GCG 


CAC 


TTC 


GGT 


GCA 


ATT 


GCT 


GCA 


GCA 


ACA 


GAG 


GTT 


CCA 


ATT 


TGT 


411 


981 

Z CJ J 


Leu 


Ala 


His 


Phe 


Gly Ala 


He 


Ala 


Ala 


Ala 


Thr 


Glu 


Val 


Pro 


lie 


Cys 




984 

Z O *x 






130 










135 










140 










9R6 
z o u 


CTC 


TAT 


GAC 


ATT 


CCT 


GGT 


CGG 


TCA 


GGT 


ATT 


CCA 


ATT 


GAG 


TCT 


GAT 


ACC 


47Q 


287 


Leu 


Tvr 
x i J - 


Asp 


lie 


Pro Gly 


Arg 


Ser 


Glv 


He 


Pro 


He 


Glu 


Ser 


Asp 


Thr 




9 ft ft 
zoo 




145 










150 










155 












990 

Z 27 U 


ATG 


AGA 


CGC 


CTG 


AGT 


GAA 


TTA 


CCT 


ACG 


ATT 


TTG 


GCG 


GTC 


AAG 


GAC 


GCC 


^27 


9Q 1 

Z 17 J. 




Arg 




T on 
iJcU 


Ser 


Glu 


Leu 


ri U 


1 11X 


T 1 A 

11c 


T on 


t\ J_CL 


Vdl 




Asp 


t\±. a 




9Q9 

Z _7 Z 


160 










165 










170 










175 




9Q4 


AAG 


GGT 


GAC 


CTC 


GTT 


GCA 


GCC 


ACG 


TCA 


TTG 


ATC 


AAA 


GAA 


ACG 


GGA 


CTT 


S7S 


9Q S 

i J J 


Lys 


Gly 


Asp 


Leu 


Val 


Ala 


Ala 


Thr 


Ser 


Leu 


He 


Lys 


Glu 


Thr 


Gly 


Leu 




9 Qfi 
z u 










180 










185 










190 






z y o 


GCC 


TGG 


TAT 


TCA 


GGC 


GAT 


GAC 


CCA 


CTA 


AAC 


CTT 


GTT 


TGG 


CTT 


GCT 


TTG 




299 


Ala 


Trp 


Tyr 


Ser 


Gly Asp 


Asp 


Pro 


Leu 


Asn 


Leu 


Val 


Trp 


Leu 


Ala 


Leu 




300 








195 










200 










205 








302 


GGC 


GGA 


TCA 


GGT 


TTC 


ATT 


TCC 


GTA 


ATT 


GGA 


CAT 


GCA 


GCC 


CCC 


ACA 


GCA 


671 


303 


Gly 


Gly 


Ser 


Gly 


Phe 


He 


Ser 


Val 


He 


Gly 


His 


Ala 


Ala 


Pro 


Thr 


Ala 




304 






210 










215 










220 










306 


TTA 


CGT 


GAG 


TTG 


TAC 


ACA 


AGC 


TTC 


GAG 


GAA 


GGC 


GAC 


CTC 


GTC 


CGT 


GCG 


719 


307 


Leu 


Arg 


Glu 


Leu 


Tyr Thr 


Ser 


Phe 


Glu 


Glu 


Gly 


Asp 


Leu 


Val 


Arg 


Ala 




308 




225 










230 










235 












310 


CGG 


GAA 


ATC 


AAC 


GCC 


AAA 


CTA 


TCA 


CCG 


CTG 


GTA 


GCT 


GCC 


CAA 


GGT 


CGC 


767 



file://C:\CRF3\OutholcKVsrJ023066.htm 



1/15/02 



RAW SEQUENCE LISTING DATE : 01/15/2002 

PATENT APPLICATION: US/10/023,066 TIME: 19:01:05 



Input Set : A:\BB1037 Sequence Listing.txt 
Output Set: N:\CRF3\01152002\J023066.raw 



E--> 



311 

312 

314 

315 

316 

318 

319 

320 

322 

323 

324 

1741 

1743 

1744 

1745 

1746 

1747 

1749 

1751 

1752 

1753 

1754 

1755 

1757 

1759 

1760 

1762 

1763 

1980 

1982 

1983 

1984 

1985 

1986 

1988 

1990 

1991 

1992 

1994 

1995 

1997 

1998 

1999 

2000 

2001 

2002 

2003 

2004 

2005 



Arg Glu lie Asn Ala 
240 

TTG GGT GGA GTC 
Leu Gly Gly Val 



Lys 
245 
TTG 
Leu 



AGC 
Ser 
260 

CCT CGA 
Pro Arg 



AAC GTA GGA GAT 
Asn Val Gly Asp 
275 

CTT GAG GCT CTC CGA GAA 

Leu Glu Ala Leu Arg Giu 
290 

(2) INFORMATION FOR SEQ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



Leu 


Ser 


Pro 


Leu 


Val Ala Ala Gin Gly Arg 












250 




255 






AAA 


uL 1 


GCT 


CTG CGT CTG CAG 


GGC 


ATC 


815 


Ala 


Lys 


Ala 


Ala 


Leu Arg Leu Gin 


Gly 


He 










265 




270 






CTT 


CCA 


ATT 


ATG 


GCT CCA AAT GAG 


CAG 


GAA 


863 


Leu 


Pro 


He 


Met 


Ala Pro Asn Glu 


Gin 


Glu 








280 




285 








GAC 


ATG 


AAA 


AAA 


GCT GGA GTT CTA 


TAA 


TGAGAATTC 


Asp 


Met 


Lys 


Lys Ala Gly Val Leu 


* 








295 






300 








! ID 


NO: 


67: 













protein 

Protein 
1. .28 



(2) 



(D) OTHER INFORMATION: /label= name 
/note= "(SSP 5)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
15 10 

Ala Met Glu Glu Lys Met Lys 

*tr is ^ ~ 

FOR SEQ ID NO: 76: 
SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: 5-1 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .172 

(D) OTHER INFORMATION: /function- "synthetic 
storage protein 
/product= " protein " 
/gene= "ssp" 
/standard_name= 
"5.5.5.7.7.7.7.5" 



Glu Lys Met Lys 

INFORMATION 
(i) 



(ii) 
(vi) 



Met Glu 



file://C:\CRF3\Outhold\VsrJ023066.htm 
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RAW SEQUENCE LISTING DATE: 01/15/2002 

PATENT APPLICATION: US/10/023 , 066 TIME: 19:01:05 



Input Set : A:\BB1037 Sequence Listing.txt 
Output Set: N:\CRF3\01152002\J023066.raw 

2007 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

2009 C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

2010 Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 



2011 
2013 
2014 
2015 
2017 
2018 
2019 
E--> 2021 





1 








5 








10 








15 




GAG 


GAG 


AAG 


ATG 


AAG 


GCG 


ATG 


GAG 


GAA 


AAG CTG 


AAA 


GCG 


ATG 


GAG 


GAG 


94 


Glu 


Glu 


Lys 


Met 


Lys 
20 


Ala 


Met 


Glu 


Glu 


Lys Leu 
25 


Lys 


Ala 


Met 


Glu 
30 


Glu 




AAA 


CTC 


AAG 


GCT 


ATG 


GAA 


GAA 


AAG 


CTT 


AAA GCG 


ATG 


GAG 


GAG 


AAA 


CTG 


142 


Lys 


Leu 


Lys 


Ala 
35 


Met 


Glu 


Glu 


Lys 


Leu 
40 


Lys Ala 


Met 


Glu 


Glu 
45 


Lys 


Leu 




AAG 


GCC 


ATG 


GAA 


GAG 


AAG 


ATG 


AAG 


GCG 


TGATAG 










( 


179 


Lys 


Ala 


Met 
50 


Glu 


Glu 


Lys 


Met 


Lys 
55 


Ala 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: ! 


98: 

















2022 
2023 
2515 

2517 (i) SEQUENCE CHARACTERISTICS: 

2518 (A) LENGTH: 59 base pairs 

2519 (B) TYPE: nucleic acid 

2520 (C) STRAND EDNESS : single 

2521 (D) TOPOLOGY: linear 

2523 (ii) MOLECULE TYPE: DNA (genomic) 



2525 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

E--> 2527 CATGGCGCCC ACCGTGATGA TGGCCTCGTC GGCCACCGCC GTCGCTCCGT TCCAGGGGC 3 ' 

2528 59 \ ■ : ' — '— — — : ~ _ 

2530 (2) INFORMATION FOR SEQ ID NO: 99: 
2532 (i) SEQUENCE CHARACTERISTICS 



W--> 2528 59 ■ — . 



is 



2533 (A) LENGTH: 59 base pairs £vaW 

2534 (B) TYPE: nucleic acid Ij^^M^ 

2535 (C) STRANDEDNESS: single SF^^ ( 

2536 (D) TOPOLOGY: linear 

2 53 8 (ii) MOLECULE TYPE: DNA (genomic) 

2540 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: oQ 
E~> 2542 TTAAGCCCCT GGAACGGAGC GACGGCGGTG GCCGACGAGG CCATCATCAC GGTGGGCGC J7^^ ' 
W--> 2543 59 ^ — — — : ' — — — — '- i U^*^ 

2700 (2) INFORMATION FOR SEQ ID NO: 107: 

2702 (i) SEQUENCE CHARACTERISTICS: 

2703 (A) LENGTH: 29 base pairs 

2704 (B) TYPE: nucleic acid 

2705 (C) STRANDEDNESS: single 

2706 (D) TOPOLOGY: linear 

2708 (ii) MOLECULE TYPE: DNA (genomic) 

2710 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

2712 CTCTCGGTAC CTAGTACCTA CTGATCAAC 29 



E--> 271 



file://C:\CRF3\Outhold\VsrJ023066.htm 
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VERIFICATION SUMMARY DATE: 01/15/2002 

PATENT APPLICATION: US/10/023,066 TIME: 19:01:06 



Input Set : A:\BB1037 Sequence Listing.txt 
Output Set: N:\CRF3\01152002\J023066.raw 



L:32 M:220 C: 
L:33 M:220 C: 
L:12 M:259 W: 
L:176 M:336 W 
L:322 M:254 E 
L:776 M:336 W 
L:843 M:336 W 
L:1380 M:336 W: 
L:1763 M:332 E: 
L:2021 M:254 E: 
L:2527 M:254 E: 
L:2528 M:336 W: 



Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
Keyword misspelled or invalid format, [(B) FILING DATE:] 
Allowed number of lines exceeded, (ii) TITLE OF INVENTION: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 3, 
No. of Bases conflict, Input: 918 Counted: 917 SEQ: 6- 
Invalid Amino Acid Number in Coding Region, SEQ ID: 29 
Invalid Amino Acid Number in Coding Region, SEQ ID: 31 
Invalid Amino Acid Number in Coding Region, SEQ ID: 53 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 67 
No. of Bases conflict, Input: 179 Counted: 175 SEQ: 76 
No. of Bases conflict, Input :0 Counted: 59 SEQ: 98 
Invalid Amino Acid Number in Coding Region, SEQ ID: 98 
L:2542 M:254 E: No. of Bases conflict, lnput:0 Counted:59 SEQ:99 
L:2543 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:99 
(46) "n" or "Xaa" used, for SEQ ID#:104 

(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS 
Wrong sequence grouping, Amino acids not in groups! 



L:2638 M:341 W: 
L:2714 M:334 W: 
L:2714 M:333 E : 
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